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(57) Abstract: A method of coding video frames in a telecommunication system, comprising: forming a video frame of consecutive 
stationary frames, storing the frame reconstruction data of at least one frame as a reference frame and the motion data of earlier coded 
neighbouring blocks, defining by means of the motion data of one or more earlier coded neighbouring blocks the motion data of the 
block to be coded, which neighbouring block is formed by means of the stored reference frame, defining the frame reconstruction 
data and motion data representing the block to be coded, which provide a pre-defined coding efficiency with a predefined picture 
quality. 
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Method and apparatus for transferring video frame in 
telecommunication system 

FIELD OF THE INVENTION 

The invention relates to a method and an apparatus for coding and 
5 decoding a video frame in a telecommunication system, when the video frame 
is made up of a string of consecutive stationary frames, in which the frames to 
be transmitted are typically divided into frame blocks or segments, for instance 
pixel groups, specific for each frame and the data in each frame block typically 
comprises information .ndicating the luminance, colour and location of the 
1 0 frame block. 

BACKGROUND OF THE INVENTION 

Transferring video frames in telecommunication systems, such as 
video phones, video conference systems or Internet connections, is a 
demanding task due to the large amount of data needed to transfer a video 

15 frame, since the more bits are needed to transfer the data, the higher the data 
transmission rate must be. Various methods have been developed to solve 
this problem. To transfer a frame, the frame is usually divided into frame 
blocks whose size is selected to suit the system. Frame block information 
generally comprises information on the luminance, colour and location of the 

20 frame block in the frame itself. The frame block data is compressed by each 
block using a desired coding method. The compression is based on removing 
the less significant data. Compression methods are divided primarily into three 
classes: spectral redundancy reduction, spatial redundancy reduction and 
temporal redundancy reduction. Various combinations of these methods are 

25 typically used in compression. 

A YUV colour model, for instance, is applied to spectral redundancy 
reduction. The YUV colour model utilises the fact that the human eye is more 
sensitive to changes in luminance than in chrominance, i.e. colour. The YUV 
model has one luminance component (Y) and two chrominance components 

30 (U, V). The chrominance components are also sub-sampled. For instance, a 
luminance block according to the H.263 video coding standard is 16 x 16 
pixels and both chrominance blocks, which cover the same area as the 
luminance block, are 8x8 pixels. The combination of one luminance block 
and two chrominance blocks is in this standard called a macro block. 
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To reduce spatial redundancy, a discrete cosine transformation 
(DCT), for instance, is used, in which the block to be compressed is preferably 
8x8 pixels. In a DCT, the pixel presentation of a frame block is transformed 
into a space-frequency presentation. In addition, only the signal frequencies 
5 which exist in a frame block have high-amplitude coefficients, whereas the 
coefficients of the signals that do not exist in a frame block are close to zero. 
DCT is also a lossless transformation and interference is caused to the signal 
in quantizing only. 

Temporal redundancy can be reduced by utilising the fact that 

10 consecutive frames usually resemble each other, so instead of compressing 
each individual frame, motion data of the frame blocks is generated. The basic 
principle is as follows: an as good as possible reference block which has been 
coded earlier is searched for the frame block to be coded, the motion between 
the reference block and the frame block to be coded is modelled and the 

15 calculated motion vector coefficients are transmitted to the receiver. The 
difference between the block to be coded and the reference block is indicated 
as a prediction error component or frame. The problem is to find a reference 
block which produces as good a coding efficiency as possible (a sufficiently 
good picture quality with as small an amount of bits as possible) and a high 

20 calculation capacity, and thus also calculation time, required by the 
comparison. 

BRIEF DESCRIPTION OF THE INVENTION 

It is thus an object of the invention to provide an improved method 
for coding video frames, an improved video coder and an improved subscriber 

25 terminal. The invention is based on using the frame reconstruction data of at 
least one or more reference frames or reference blocks arid of one or more 
earlier coded neighbouring blocks in coding the frame reconstruction data. The 
frame reconstruction data and motion data coded with the method of the 
invention or with another known method is selected for use depending on how 

30 the desired coding efficiency is obtained with a sufficient picture quality. 

This is achieved by a method of coding video frames in a 
telecommunication system, comprising: forming a video frame of consecutive 
stationary frames, storing the frame reconstruction data of at least one frame 
as a reference frame and the motion data of earlier coded neighbouring 

35 blocks, defining by means of the motion data of one or more earlier coded 
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neighbouring blocks the motion data of the block to be coded, which 
neighbouring block is formed by means of the stored reference frame, defining 
the frame reconstruction data of the frame to be coded, selecting for use the 
frame reconstruction data and motion data representing the block to be coded, 
5 which provide a pre-defined coding efficiency with a predefined picture quality. 

This is achieved by a method of coding video frames in a 
telecommunication system, comprising: forming a video frame of consecutive 
stationary frames, storing the frame reconstruction data of at least one frame 
as a reference frame and the motion data of earlier coded neighbouring 

10 blocks, defining the motion data of a frame block to be coded using only the 
neighbouring blocks which have the same reference frame used to code the 
frame reconstruction data of the block to be coded, selecting for use the frame 
reconstruction data and motion data, which provide a pre-defined coding 
efficiency with a pre-defined picture quality. 

15 The invention also relates to a subscriber terminal comprising: 

means for forming a video frame of consecutive stationary frames, means for 
storing the frame reconstruction data of at least one frame as a reference 
frame and the motion data of earlier coded neighbouring blocks, means for 
defining by means of the motion data of one or more earlier coded 

20 neighbouring blocks the motion data of the block to be coded, which 
neighbouring block is formed by means of the stored reference frame, means 
for defining the frame reconstruction data of the frame to be coded, means for 
selecting for use the frame reconstruction data and motion data representing 
the block to be coded, which provide a pre-defined coding efficiency with a 

25 predefined picture quality. 

The invention also relates to a subscriber terminal comprising: 
means for forming a video frame of consecutive stationary frames, means for 
storing the frame reconstruction data of at least one frame as a reference 
frame and the motion data of earlier coded neighbouring blocks, means for 

30 defining the motion data of a frame block to be coded using only the 
neighbouring blocks which have the same reference frame used to code the 
frame reconstruction data of the block to be coded, means for selecting for use 
the frame reconstruction data and motion data, which provide a pre-defined 
coding efficiency with a pre-defined picture quality. 

35 The invention also relates to a video coder comprising: means for 

forming a video frame of consecutive stationary frames, means for storing the 
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frame reconstruction data of at least one frame as a reference frame and the 
motion data of earlier coded neighbouring blocks, means for defining by 
means of the motion data of one or more earlier coded neighbouring blocks 
the motion data of the block to be coded, which neighbouring block is formed 
5 by means of the stored reference frame, means for defining the frame 
reconstruction data of the frame to be coded, means for selecting for use the 
frame reconstruction data and motion data representing the block to be coded, 
which provide a pre-defined coding efficiency with a predefined picture quality. 

The invention also relates to a video coder comprising: means for 

10 forming a video frame of consecutive stationary frames, means for storing the 
frame reconstruction data of at least one frame as a reference frame and the 
motion data of earlier coded neighbouring blocks, means for defining the 
motion data of a frame block to be coded using only the neighbouring blocks 
which have the same reference frame used to code the frame reconstruction 

15 data of the block to be coded, means for selecting for use the frame 
reconstruction data and motion data, which provide a pre-defined coding 
efficiency with a pre-defined picture quality. 

The invention also relates to a computer program which can be 
read by a computer and which executes the method of coding video frames in 

20 a telecommunication system, comprising: forming a video frame of 
consecutive stationary frames, storing the frame reconstruction data of at least 
one frame as a reference frame and the motion data of earlier coded 
neighbouring blocks, defining by means of the motion data of one or more 
earlier coded neighbouring blocks the motion data of the block to be coded, 

25 which neighbouring block is formed by means of the stored reference frame, 
defining the frame reconstruction data of the frame to be coded, selecting for 
use the frame reconstruction data and motion data representing the block to 
be coded, which provide a pre-defined coding efficiency with a predefined 
picture quality. 

30 The invention also relates to a computer program which can be 

read by a computer and which executes the method of coding video frames in 
a telecommunication system, comprising: forming a video frame of 
consecutive stationary frames, storing the frame reconstruction data of at least 
one frame as a reference frame and the motion data of earlier coded 

35 neighbouring blocks, defining the motion data of a frame block to be coded 
using only the neighbouring blocks which have the same reference frame used 
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to code the frame reconstruction data of the block to be coded, selecting for 
use the frame reconstruction data and motion data, which provide a pre- 
defined coding efficiency with a pre-defined picture quality. 

The method and apparatus of the invention provide savings in 
5 calculation time and capacity while the picture quality remains adequate. 

BRIEF DESCRIPTION OF THE FIGURES 

In the following, the invention will be described by means of 
preferred embodiments, with reference to the attached drawings in which 

Figure 1 illustrates an example of a telecommunication system, 
10 Figures 2a to 2c illustrate one estimation process of vectors using a 

reference frame, 

Figure 3 illustrates one estimation process of a motion vector using 
neighbouring blocks, 

Figure 4 is a flow chart which shows method steps of finding a 
15 reference block and predicting a motion vector, 

Figure 5 illustrates an example of finding a reference block and 
predicting a motion vector, 

Figure 6 illustrates an example of a coder, 

Figure 7 illustrates an example of a decoder, 
20 Figure 8 illustrates an example of a subscriber terminal, 

Figure 9 illustrates a second example of finding a reference block 
and predicting a motion vector. 

DETAILED DESCRIPTION OF THE INVENTION 

Figure 1 illustrates in a simplified manner one digital data 

25 transmission system to which the solution of the invention can be applied. It is 
a part of a cellular radio system comprising a base station 104 which is in radio 
contact 108 and 110 with subscriber terminals 100 and 102 which can be 
terminals installed fixedly or in a vehicle or portable terminals. Transceivers in 
the base station are connected to an antenna unit, with which the radio link to 

30 the subscriber terminals is implemented. The base station is further in contact 
with a base station controller 106 which transmits the connections of the 
terminals elsewhere in the network. The base station controller controls in a 
centralised manner several base stations connected to it. A control unit in the 
base station controller takes care of call control, data traffic control, mobility 

35 management, collection of statistics, and signalling. 
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The cellular radio system can also have a connection to the public 
telephone network, in which case a transcoder belonging to the system 
transforms the different digital coding formats of speech used between the 
public telephone network and the cellular radio network to suit each other, for 
5 instance from the 64 kbit/s format of the fixed network to a cellular radio 
network format (e.g. 13 kbit/s) and vice versa. 

It is obvious to a person skilled in the art that the invention can also 
be applied to fixed telecommunication systems, such as the Internet. 

The aim in coding a video frame is to reduce the number of bits 
10 required to transfer the frame. Figures 2a to 2c show in a simplified manner a 
prior art coding method of motion which is based on predicting the motion 
vector of a frame block (the term block is also used in the text) using a 
reference frame stored earlier in memory. This type of coding is called inter- 
coding which means utilising the similarities between the frames in a string of 
1 5 frames in compression coding. 

In the example of Figures 2a to 2c, the reference frame 206 has 
been divided into blocks or segments and coded earlier. Figures 2b to 2c do 
not show frame blocks. The reference frame can be before or after the frame 
to be coded in the string of frames. Figure 2a shows the current frame being 
20 coded 204 which is also divided into blocks, of which only one block 200 is 
shown for clarity's sake. The size of the blocks is N x M. The blocks are 
typically squares, i.e. N = M, but can also be rectangular, for instance, in their 
geometry. The current block 200 and the reference block 214 shown in Figure 
2c need not be geometrically similar, because in reality, too, the filmed objects 
25 change, such as rotate or twist. The size and shape of the block is defined 
suitable for each situation taking into consideration the following matters, for 
instance: a small block size provides a better picture, but reduces the reliability 
of the defined motion vector, because the definition is made by means of a few 
pixels only. More motion vector data must also be generated, if smaller blocks 
30 are used, because then the frame must be divided into a greater number of 
blocks. In addition, fast algorithms in defining motion vectors are more efficient 
when the block size is large. For instance, in some of the current video coding 
standards N = M = 16. 

To estimate the motion, as matching a reference block as possible 
35 is found in the reference frame 206 for the block 200 being coded, this being 
214 in Figure 2c. The suitability of the reference blocks depends on the 
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selection criterion. The purpose of the selection criterion is typically to find a 
block, by means of which it is possible to minimise the number of bits required 
for transferring the block being coded in such a manner that the picture quality 
remains sufficiently good. Thus, the selection criteria can vary depending on 
5 the object of application. 

The search is usually performed in the reference frame 206 in an 
area which is in a frame location corresponding to the block 200 being coded 
in the current frame 204 and in a certain area in the immediate proximity, 
depicted by the search area 210 in Figures 2b and 2c. By limiting the search 

10 area smaller than the entire reference frame, the time used in searching and 
the required calculation is reduced. The size of the search area is defined 
suitable for the object of application, for instance by determining on the basis 
of the frames of the frame string coded earlier, whether the motion is slow or 
fast; if the motion is slow, the search area can be limited smaller than if the 

15 motion is fast. The search is typically conducted in the surroundings of the 
predicted motion vector. 

The location of the frame block is in the example of Figures 2a to 2c 
indicated with coordinates (x, y) in the upper left corner of the block. The x 
coordinate indicates the location of the pixel or block in horizontal direction 

20 and the y coordinate the location of the pixel or block in vertical direction. The 
coordinates of the block being coded are marked as 202. In Figure 2c, the 
best possible reference frame block 214 has been found in the search area 
210. The coordinates (x, y) 208 are those of the location where the block 200 
being coded would be if it had not moved and thus the same as the 

25 coordinates 202. They are shown for the purpose of illustration. The block 212 
depicts the block 200 had it not moved and is also shown for the purpose of 
illustration. The motion of the frame block being coded can now be indicated 
as a change from the coordinates (x+u, y+v) 216 to the coordinates (x, y) 208. 
In general, the coordinates (x, y) are defined as origo, so the motion vector 

30 218 of the block being coded can be indicated as coordinates (u, v). The 
motion vector can also be indicated using definition of motion vector 
coefficients and motion models. 

Figure 3 shows in a simplified manner prior art motion coding 
methods which are based on predicting the motion vector coefficients of a 

35 frame block being coded using earlier coded neighbouring blocks of the frame 
being coded. The neighbouring blocks of the frame block 304 being coded, 
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which have already been coded earlier and which are used to predict the 
motion vector coefficients of the frame block being coded are here referred to 
as prediction candidates. In Figure 3, the prediction candidates are 
neighbouring blocks 300, 302, 306. The prediction candidates 300, 302, 306 
5 are in the example of Figure 3 marked for identification with certain pixel 
locations 308, 310, 312. The prediction candidate 306 is on the left side of the 
block being coded and marked with the pixel location 308, the prediction 
candidate 300 is above the block being coded and marked with the pixel 
location 310 and the prediction candidate 302 is in the top right comer of the 
10 block being coded and marked with the pixel location 312. As seen from 
Figure 3, the pixel locations 308, 310, 312 reside in their blocks in close 
proximity to the block 304 being coded. It should be noted that the number 
and location of the neighbouring blocks with respect to the block being coded 
may differ from those shown in Figure 3. 
15 in the case shown in Figure 3, all three prediction candidates are 

used in predicting the motion vector coefficients of the block 304 being coded. 
The required motion vector coefficients can, for instance, be predicted as a 
median of the motion vector coefficients obtained from these prediction 
candidates. It is also possible to use only one prediction candidate in 
20 predicting the motion vector coefficients of the block 304, in which case the 
coefficients are obtained from the motion vector coefficients of the prediction 
block 306, the motion vector coefficients of the prediction block 300 or the 
motion vector coefficients of the prediction block 302. 

Earlier coded neighbouring blocks that are farther than in the 
25 immediate proximity ca., also be used, but in such a case it should be done 
taking into consideration that the difference in the motion of the blocks usually 
increases while the distance grows. 

Inter- and intra-coding are typically combined block by block in 
coding video frames to obtain the best possible result. An intra-coded frame 
30 can be sent at suitable intervals to prevent the accumulation of possible errors 
caused by the transmission channel from one frame to another, finally causing 
the failure of the decoding of the frame. 

The methods shown in Figures 2a to 2c and 3 can be used both in 
coding and decoding. The method used by the coder with the necessary frame 
35 data is transmitted to the decoder as a bit stream. 
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It should be noted that as shown in Figure 3, the blocks can also be 
segmented. Segmentation is preferable in situations where different parts of 
the frame move in different ways, for instance the background remains the 
same while a single object, such as a person, a car or the like, moves. A frame 
5 block can be segmented by dividing it in segments vertically, as in block 300, 
horizontally, of which there is no example, or in both directions, as in block 
306. By segmentation, it is also possible to reduce the number of bits being 
transferred by using a suitable part of another already coded frame. 

When defining the motion vector coefficients of the frame block 
10 being coded, various known models for motion estimation are utilised. 
Different models are suitable for defining different motion, such as presenting 
vertical and horizontal motion or presenting rotation. Such models include the 
affine motion model in which the motion of a pixel, frame segment or block is 
indicated by a formula in which Ax and Ay form the motion vector 



15 



Ax(x, y) = a 0 + a x x + a 2 y 
Ay(x,y) = b 0 + b l x + b 2 y 



wherein a 0 , a lP a 2 , b 0 , b 1 and b 2 are motion vector coefficients and x 
and y are vector components, of which x indicates the location of a pixel or a 
20 block in horizontal direction and y the location of a pixel or a block in vertical 
direction, or the translational model in which the motion of a pixel, frame 
segment or block is indicated by a formula in which Ax and Ay form the motion 
vector 

Ax(x,y) = a 0 ^ 

25 

wherein a 0 and b 0 are motion vector coefficients and x and y are 
vector components. The used motion models are stored in the memory of both 
the coder and the decoder. If several motion models are used in coding, the 
coder selects the model according to the motion (e.g. with upward motion, a 
30 model indicating upward motion is used in coding) and information on the used 
model is transmitted to the decoder together with the motion vector 
coefficients. 

Figure 4 is a flow chart showing method steps for video frame 
coding. In the following description, a frame block refers, for instance, to a 
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block, segment or macro block according to the YUV model formed by pixels. 
The execution of the method starts in block 400. 

In block 402, a video frame is formed of consecutive stationary 
frames. To form the video frame, the stationary frames are sent preferably at a 
5 speed at which the human visual system no longer distinguishes separate 
frames from each other. 

In block 404, the coded reconstruction data of frame blocks (e.g. 
colour and luminance of pixels and the prediction error frame obtained from 
the calculation of motion vector coefficients) are stored into memory. The 
10 frame reconstruction data is coded according to the applied standard, for 
instance H.263 or MPEG. The definition of the motion vector coefficients and 
the prediction error frame is described above. One or more reference frames 
are formed of the frame reconstruction data stored in memory. In addition, the 
motion data of the earlier coded neighbouring blocks, i.e. the motion vector 
15 coefficients and a possibly used motion model, are stored into memory. It is 
most preferable to use neighbouring blocks in close proximity to the block 
being coded, but sometimes it is also possible to use earlier coded blocks 
farther away in the frame. 

In block 406, the motion data of the block being coded is defined by 
20 means of earlier coded neighbouring blocks, preferably using the motion 
vector coefficients of one or more neighbouring blocks. 

In block 408, the reconstruction data of the block being coded is 
defined using a reference frame, for instance. 

The method can be intensified by calculating the motion data of the 
25 frame block being coded using only the neighbouring blocks which have the 
same reference frame that was used to code the frame reconstruction data of 
the block being coded. Similarities often exist between frames in the same 
frame string, so it is probable that either a block corresponding to the block 
being coded is found or a block that is similar enough is found in an earlier 
30 coded frame from which a suitable reference block has already been found for 
an earlier, coded neighbouring block. For instance, motion vector coefficients 
of one or more neighbouring blocks are used and the search area of the 
reference block of the block being coded is restricted in the reference frame to 
the environment defined by the motion vector coefficients of the neighbouring 
35 block. 
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In a situation in which only one inter-coded neighbouring block has 
the same reference frame as the block being coded, the motion vector of this 
neighbouring block is selected as one motion vector candidate, another 
candidate can, for instance, be a median calculated from several neighbouring 
5 block motion vectors. 

It should be noted that the block, which is used in coding the block, 
is coded in the same manner (inter) with which the block being coded will be 
coded, i.e., if the neighbouring block is intra-coded, it has no reference frame. 
If there is no prediction candidate having a reference frame, prior art intra- 
10 coding can be used or all reference frames stored in memory can be checked. 

A reference block is the frame block that the video coder, i.e. coder 
with which the frame reconstruction data of the frame block is coded, uses in 
compression coding. A frame block, which is as matching as possible with the 
block being coded and which was searched from one or more reference 
15 frames that were coded earlier and stored in memory, is selected as the 
reference block, whereby the prediction error frame remains as small as 
possible. 

In block 410, the coding method is selected, with which the frame 
reconstruction data and motion data of the frame block being coded can be 

20 coded most advantageously, i.e. a pre-defined coding efficiency is reached 
with a pre-defined picture quality. It should be noted that as the coding method 
of the block being coded either the above-mentioned coding by means of the 
motion data of the neighbouring blocks is selected, or any prior art method, 
such as inter-coding, in which the block being coded is coded directly by 

25 means of the reference block found from the reference frame. In selecting the 
coding method, the aim may, for instance, be to make the number of bits as 
small as possible while the quality of the picture corresponds to that of a 
normal video picture or to make the number of bits match that of the used 
standard, even though a smaller number of bits was enough to achieve a 

30 picture quality according to the application object. 

Coding efficiency can be estimated by means of a Lagrangian 
function, for instance, which defines the most effective segmentation and 
coding and which is 



35 



L(S k ) = D(S k ) + XR(S k ) 
wherein 



(3) 
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D(S k ) is an error created in despreading the frame, 

R(S k ) is the number of bits required to transfer the frame, and 

X is the Lagrangian constant. 

5 The Lagrangian function depicts the connection between the quality 

of the frame being transferred and the bandwidth required on the transmission 
path; in general, the higher quality picture, the bigger the bandwidth. The 
selection of the most advantageous compression coding is always a 
compromise between the required number of bits and the picture quality. 
10 Arrow 412 depicts the repeatability of the method block by block 

until the frame is coded. It should be noted that the method is preferably 
repeated until all frames in the string of frames are coded. 

The execution of the method ends in block 414. 
Figure 5 illustrates an example of predicting motion vectors and 
15 finding a reference block, when only the neighbouring blocks are used, which 
have the same reference frame that was used in coding the frame 
reconstruction data of the block being coded. It should be noted that that in 
coding the block, prior art methods are usually also used. In the end, the 
coding method is selected for use, which provides the desired coding 
20 efficiency with a sufficient picture quality. 

The frame page 500 being coded is divided into blocks, of which 
only a few are shown for clarity's sake. Block 506 is the one being coded. 
Frames 502, 504, 532 and 508 are reference frames stored in memory. The 
reference frame of blocks 510 and 514 of the earlier coded neighbouring 
25 blocks, marked as number 1 , is the frame 502. The block 506 being coded and 
the reference blocks 510 and 514 have not moved in the frame 500 in 
comparison with the frame 502. The reference frame of the neighbouring block 
512, marked as number 2 in the figure, is the frame 504 and the reference 
frame of the neighbouring block 516, marked as number 4 In the figure, is the 
30 frame 508. The blocks in the frame 500 have moved in comparison with the 
frames 504 and 508. The frame 532 has not been used in coding any of the 
neighbouring blocks of the block 506 being coded, i.e. it is not a reference 
frame of any neighbouring block, so the frame 532 is not used in this example 
to find a reference block for the block 506 being coded. 
35 Next, the neighbouring blocks 510 and 514 are used as prediction 

candidates of the motion vectors of the block 506 being coded. A check is 
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made to see whether the coding type (inter) of the prediction candidates is the 
same as the one to be used in coding the current block. If this is so, the block 
526 is used as a reference block and the motion vector coefficients are 
obtained by calculating an average or median of the motion vector coefficients 
5 which have been obtained by comparing the location of the blocks 510 and 
518, from which the motion vector coefficients have been obtained for the 
neighbouring block 510, and the location of the blocks 514 and 520, from 
which the motion vector coefficients have been obtained for the neighbouring 
block 514. In addition, a prediction error frame is defined by comparing the 

10 block 506 being coded with the reference block 526. In the example of Figure 
5, the block location in frames 500 and 502 has not changed, so the motion 
vector coefficients are not defined. 

Next, the coding type of the neighbouring block 512 is checked. If it 
is the same as the one to be used in the current block 506, a reference block 

15 is searched for in frame 504 and the motion vector coefficients obtained from 
the comparison of the blocks 512 and 522 are used as the motion vector 
coefficients of the block 506. The prediction error frame is then defined by 
means of the reference block 528. 

Next, the coding type of the neighbouring block 516 is checked. If it 

20 is the same as the one to be used in the current block 506, a reference block 
is searched for in frame 508 and the motion vector coefficients obtained from 
the comparison of the blocks 516 and 524 are used as the motion vector 
coefficients of the block 506. The prediction error frame is then defined by 
means of the reference block 530. 

25 The motion vector coefficients for block 506 can also be defined by 

calculating a median or average of the motion vector coefficients of the blocks 
510, 512 and 514. 

Also, if several motion models are used, the used motion model is 

indicated. 

30 The block 534 is intra-coded, so it does not have a reference frame 

and is thus not used in coding the block 506, if a preferable coding method is 
found otherwise. 

The number and location of the neighbouring blocks and the 
number of the stored reference frames may differ from what is shown in Figure 
35 5. 
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Figure 9 illustrates a second example of predicting motion vectors 
and finding a reference block. In this example, the block being coded is coded 
by means of only one neighbouring block, if only one neighbouring block has 
been inter-coded and this neighbouring block has the same reference frame 
5 as the block being coded. 

The frame 900 being coded is divided into blocks, of which only a 
few are shown for clarity's sake. Block 912 is the one being coded. Frames 
902, 904 and 906 are reference frames stored in memory. The reference 
frame of the earlier coded neighbouring block 914 (marked as 1) is the frame 
10 902 and the reference frame of block 910 (marked as 2) is the frame 904. The 
block 908 is intra-coded, so it does not have a reference frame. 

The example in Figure 9 tests whether for the block 912 being 
coded suitable reference blocks can be found of all the reference frames 
stored in memory, i.e. frames 902, 904, 906 in this example. When testing 
1 5 whether the frame 902 has a suitable reference block, the motion vector of the 
block 914 is used as the motion vector prediction candidate, since the block 
914 is the only one of the three neighbouring blocks, whose reference frame is 
the frame 902. When testing whether the frame 904 has a suitable reference 
block, the motion vector of the block 910 is used as the motion vector 
20 prediction candidate, since the block 910 is the only one of the three 
neighbouring blocks, whose reference frame is the frame 904. The frame 906 
is not a reference frame for any of the neighbouring blocks, so in the case 
described in the example, the motion vector is predicted using a median of the 
blocks 910, 914 and 908, for instance, in which case the motion vector of the 
25 inter-coded block 908 is set to zero. 

It should be noted that in predicting the motion vectors of the block 
being coded, it is possible to use not only the motion data of the neighbouring 
blocks and the reference frame data, but also segmentation information of the 
macro block, for instance, to which the block being coded belongs. The 
30 number and location of the neighbouring blocks and the number of the stored 
reference blocks may differ from that shown in Figure 9. 

Figure 6 shows an example of a coder with which the above- 
mentioned method of coding video frames can be implemented. A design 
principle of video coders is to minimise the prediction error frame E n {x,y) 600 
35 between the block being coded I„(x,y) 620 and the prediction candidate (the 
selected reference block) P n (x,y) 612. The prediction error frame is defined as 
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a difference between the block being coded 620 and the prediction candidate 
612, which is obtained in an adder 632, thus 

E n (x,y) = I a (x 9 y)-P n (x,y). (4) 

5 

The prediction block 612 is obtained by a method according to a 
preferred embodiment of the invention by using a selected reference frame 
and motion vectors. The coefficients of motion vectors Ax(x,y),Ay(x,y) are 

formed in the motion vector calculation block 630 as a sum of predicted motion 

10 vectors and found difference coefficients. The predicted motion vector 
coefficients are, according to the preferred embodiment of the invention, 
obtained using the motion vectors of the neighbouring blocks which have the 
same reference frame as the block being coded. Other methods are also used 
in predicting motion vectors, one of them is a prior art method in which the 

15 block being coded is coded directly by means of a reference frame. The group 
of the motion vectors of all pixels in a frame is called a motion vector field. 
Because there are a great number of pixels in a frame, a frame is in practice 
divided into blocks in video coding and the motion information is transmitted 
block by block to the receiver decoder shown in Figure 7. 

20 At least one motion vector selected from the predicted motion 

vectors or motion vector fields formed of them are coded in a motion vector 
coding block 628. Motion models well known in the art, for instance, of which 
examples are presented above, are used in the coding. The coded motion 
vectors or motion vector fields formed of them are transmitted to a multiplexer 

25 624. 

In block 618, a frame is formed of the frame blocks segment by 
segment or block by block. The reference block or prediction candidate is of 
format 

30 P n (*,y) = Ax(*,y),y+ 4K*,j>)] (5) 

wherein one or more reference frames R$(x,y) 622 are stored in 
memory 610 and wherein x and y are vector components, x showing the 
location of a pixel, frame segment or block in horizontal direction and y in 
35 vertical direction, and Ax and Ay forming the motion vector of a pixel, frame 

segment or block. 
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In block 602, the prediction error frame E„(x,y) 600 is coded, which 
typically is presented as a finite 2-dimensional series transformation, for 
instance by means of a discrete cosine transformation (DCT). The coefficients 
obtained by the DCT transformation are quantized and coded before 
5 transmitting them to the multiplexer 624 and the receiver decoder shown in 
Figure 7. 

The coder also comprises a decoder block 606 in which the 
prediction error frame E n (x,y) is decoded. This prediction error frame E B (x,y) 
608 is added in the adder 634 to the prediction block P„(x,y) 612 and a 

10 decoded block I„(x,y) is obtained as a result, from which a new reference 
block is obtained, or by combining the blocks belonging to the same page, the 
reference frame 614 is obtained and stored in memory 610. Thus both the 
transmitter video coder and the receiver video coder have the same reference 
frame or block for use. 

15 In the multiplexer 624, the information containing the motion vector 

information is combined with the prediction error frame and this frame 
information 616 is transmitted to the decoder (Figure 7). 

The video coder functions described above can be implemented in 
many ways, for instance by software executed on a processor or by hardware, 

20 such as a logic built of separate components or ASIC (application-specific 
integrated circuit). 

Figure 7 shows an example of a decoder, by means of which the 
method described above for reducing temporal redundancy in video frame 
transfer can be implemented. A coded prediction error frame 702 and coded 

25 motion vector coefficients or motion vector field coefficients 712 of a received 
video signal 700 are separated from each other in a demultiplexer 710. 

In the memory 716 of the decoder, there is at least one earlier 
coded neighbouring block and/or a reference frame B^(x,y) 722 divided into 

blocks. The same frame blocks and/or reference frame are stored in the coder 

30 and decoder memory. 

In block 714, a prediction candidate P n (x,y) 718 is formed in the 

same manner as in the coder block 618 (Figure 6). A decoded prediction error 

frame E„(x,y) 706 is formed in block 704. 

In an adder 720, the decoded prediction error frame E„(x,y) 706 

35 and prediction candidate P„(x,y) 718 are added together. This produces a 

decoded block 7„(x,y) 708 of the format 
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l„ (*, y) = P„ (x, y) +E„(x, y) 

= R*[x + te(x,y),y + Ay(x,y)] + E,£x,y) 

wherein one or more earlier coded neighbouring blocks and/or 
5 reference frame R k „(x, y) 722 divided into blocks is stored in memory 716 and 
wherein x and y are vector components, of which x indicates the location of a 
pixel or a block in horizontal direction and y the location of a pixel or a block in 
vertical direction, and Ax and Ay form the motion vector of a pixel, frame 

segment or block. 

10 The video coder functions described above can be implemented in 

many ways, for instance by software executed on a processor or by hardware, 
such as a logic built of separate components or ASIC (application-specific 
integrated circuit). 

Figure 8 illustrates one terminal of a radio system, by means of 
15 which the method of a preferred embodiment of the invention can be 
implemented. A transmitter of a subscriber terminal 824 and a transmitter of a 
radio system network element perform partly the same tasks. The terminal 824 
can, for instance, be a mobile phone or microcomputer including radio parts, 
without limiting the invention thereto. The described terminal comprises an 
20 antenna 800 which is used to transmit and receive signals through a duplex 
filter. The terminal can also comprise several antennas or a multi-antenna 
system. The terminal also comprises a transmitter 802 which amplifies and 
transmits to the antenna a modulated signal, a modulator 804 which 
modulates according to a selected modulation method the carrier with a data 
25 signal containing desired information, a receiver 806 which amplifies the signal 
coming from the antenna and down-converts it to a desired intermediate 
frequency or directly to base band, a demodulator 808 which demodulates the 
received signal so that the data signal can be separated from the carrier. 

The subscriber terminal also has a control block 816 which controls 
30 the operation of the different parts of the terminal, such as a camera 820 and 
video codec 822, and takes the necessary action to process user speech or 
data generated by a user, such as digital signal processing (DSP), digital-to- 
analogue converting and filtering. In addition, in spread-spectrum systems, 
such as WCDMA, the signal spectrum is spread by means of a pseudo- 
35 random spreading code in a transmitter to a broad band and despread in a 
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rece j ver> thus endeavouring to increase the capacity of the channel. The 
control block also performs both coding and decoding, such as channel and 
speech coding. In addition, the control block adapts the signal being 
transmitted and signalling information to match the air interface standard of the 
5 used radio system. The subscriber terminal comprises one or more memories 
of different type which can be parts of other blocks or separate (not shown). In 
the example of Figure 8, the control block also contains a program controlling 
the operation of the terminal. In the example of the figure, the video codec 822 
which codes and decodes the reconstruction data of a frame is separate, but it 
10 can also be a part of the control block 816. 

When transmitting frames, the camera 820 records a digital picture 
into the memory block 818. The camera 820 can also record the picture 
directly into the codec 822. The codec codes a selected frame by means of 
one of the above methods or by a combination of several methods. The 
15 camera 820, codec 822 and memory 818 can reside in the subscriber terminal 
824 or belong to a separate camera unit 826, in which case the camera unit 
826 is connected to the subscriber terminal 824 with an interface 828. 

A user of the subscriber terminal can when necessary view the 
frame on a display 814. The coded frame reconstruction data is transmitted in 
20 the same manner as speech or text data to the telecommunication system and 
transmitted to another phone, for instance. 

The operational blocks of the terminal described above, such as the 
control block and the codec block 822 required for frame coding and decoding, 
can be implemented in many ways, for instance by software executed on a 
25 processor or by hardware, such as a logic built of separate components or 
ASIC (application-specific integrated circuit). The terminal functions shown in 
the figure can also be combined into operational blocks in many ways differing 
from the figure, the codec block 822 can, for instance, be divided in to a coder 
and decoder. 

30 The user interface of the terminal comprises a loudspeaker or an 

ear piece 810, a microphone 812, a display 814 and possibly a keyboard, 

which all are connected to the control block 81 6. 

Even though the invention has been explained in the above with 

reference to examples in accordance with the accompanying drawings, it is 
35 obvious that the invention is not restricted to them but can be modified in many 

ways within the scope of the inventive idea disclosed in the attached claims. 
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The invention can be applied together with several video coding standards, 
such as H.263 or MPEG. 
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CLAIMS 

1. A method of coding video frames in a telecommunication 

system, comprising 

forming a video frame of consecutive stationary frames, 
5 storing the frame reconstruction data of at least one frame as a 

reference frame and the motion data of earlier coded neighbouring blocks, 

defining by means of the motion data of one or more earlier coded 
neighbouring blocks the motion data of the block to be coded, which 
neighbouring block is formed by means of the stored reference frame, 
1 0 defining the frame reconstruction data of the frame to be coded, 

selecting for use the frame reconstruction data and motion data 
representing the block to be coded, which provide a pre-defined coding 
efficiency with a predefined picture quality. 

2. A method of coding video frames in a telecommunication 
1 5 system, comprising 

forming a video frame of consecutive stationary frames, 

storing the frame reconstruction data of at least one frame as a 

reference frame and the motion data of earlier coded neighbouring blocks, 

defining the motion data of the frame block of the frame being 
20 coded by using only the neighbouring blocks which have the same reference 

frame that was used in coding the frame reconstruction data of the block being 

coded, 

selecting for use the frame reconstruction data and motion data 
representing the block to be coded, whicji provide a pre-defined coding 
25 efficiency with a predefined picture quality. 

3. A method as claimed in claim 1 or 2, wherein the reference 
frames are stored as blocks. 

4. A method as claimed in claim 1 or 2, wherein the reference 

frames are stored as frames. 
30 5. A method as claimed in claim 1 or 2, wherein the block being 

coded is coded by means of one neighbouring block only, if only one 

neighbouring block was inter-coded and this neighbouring block had the same 

reference frame as the block being coded. 

6. A method as claimed in claim 1 or 2, wherein the frame 
35 reconstruction data comprises at least one of the following: information 
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indicating the luminance, colour, location and prediction error frame of the 
frame block. 

7. A method as claimed in claim 1 or 2, wherein the motion of the 
frame block is indicated by means of motion vectors. 
5 8. A method as claimed in claim 1 or 2, wherein the earlier coded 

neighbouring blocks are immediate neighbouring blocks of the frame block 
being coded. 

9. A method as claimed in claim 1 or 2, wherein the earlier coded 
neighbouring blocks are not in the immediate proximity of the frame block 

10 being coded, but farther away in the frame formed by frame blocks. 

10. A method as claimed in claim 1 or 2, wherein a prediction error 
frame is obtained by comparing the frame block being coded and the 
reference frame. 

11. A method as claimed in claim 1 or 2, wherein coding efficiency 
1 5 is estimated by means of a Lagrangian function. 

12. A subscriber terminal comprising 

means for forming a video frame of consecutive stationary frames, 
means for storing the frame reconstruction data of at least one 
frame as a reference frame and the motion data of earlier coded neighbouring 
20 blocks, 

means for defining by means of the motion data of one or more 
earlier coded neighbouring blocks the motion data of the block to be coded, 
which neighbouring block is formed by means of the stored reference frame, 

means for defining the frame reconstruction data of the frame to be 

25 coded, 

means for selecting for use the frame reconstruction data and 
motion data representing the block to be coded, which provide a pre-defined 
coding efficiency with a predefined picture quality. 

13. A subscriber terminal comprising 

30 means for forming a video frame of consecutive stationary frames, 

means for storing the frame reconstruction data of at least one 
frame as a reference frame and the motion data of earlier coded neighbouring 
blocks, 

means for defining the motion data of a frame block to be coded 
35 using only the neighbouring blocks which have the same reference frame used 
to code the frame reconstruction data of the block to be coded, 
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means for selecting for use the frame reconstruction data and 
motion data, which provide a pre-defined coding efficiency with a pre-defined 
picture quality. 

14. A subscriber terminal as claimed in claim 12 or 13, wherein the 
5 reference frames are stored as blocks. 

15. A subscriber terminal as claimed in claim 12 or 13, wherein the 
reference frames are stored as frames. 

16. A subscriber terminal as claimed in claim 12 or 13, wherein the 
block being coded is coded by means of one neighbouring block only, if only 

10 one neighbouring block was inter-coded and this neighbouring block had the 
same reference frame as the block being coded. 

17. A subscriber terminal as claimed in claim 12 or 13, wherein the 
frame reconstruction data comprises at least one of the following: information 
indicating the luminance, colour, location and prediction error frame of the 

1 5 frame block. 

18. A subscriber terminal as claimed in claim 12 or 13, wherein the 
motion of the frame block is indicated by means of motion vectors. 

19. A subscriber terminal as claimed in claim 12 or 13, wherein the 
earlier coded neighbouring blocks are immediate neighbouring blocks of the 

20 frame block being coded. 

20. A subscriber terminal as claimed in claim 12 or 13, wherein the 
earlier coded neighbouring blocks are not in the immediate proximity of the 
frame block being coded, but farther away in the frame formed by frame 
blocks. 

25 21. A subscriber terminal as claimed in claim 12 or 13, wherein a 

prediction error frame is obtained by comparing the frame block being coded 

and the reference frame. 

22. A subscriber terminal as claimed in claim 12 or 13, wherein 

coding efficiency is estimated by means of a Lagrangian function. 
30 23. A video coder comprising 

means for forming a video frame of consecutive stationary frames, 
means for storing the frame reconstruction data of at least one 

frame as a reference frame and the motion data of earlier coded neighbouring 

blocks, 
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means for defining by means of the motion data of one or more 
earlier coded neighbouring blocks the motion data of the block to be coded, 
which neighbouring block is formed by means of a stored reference frame, 

means for defining the frame reconstruction data of the frame to be 

5 coded, 

means for selecting for use the frame reconstruction data and 
motion data representing the block to be coded, which provide a pre-defined 
coding efficiency with a predefined picture quality. 

24. A video coder comprising 
1 0 means for forming a video frame of consecutive stationary frames, 

means for storing the frame reconstruction data of at least one 
frame as a reference frame and the motion data of earlier coded neighbouring 
blocks, 

means for defining the motion data of a frame block to be coded 
15 using only the neighbouring blocks which have the same reference frame used 
to code the frame reconstruction data of the block to be coded, 

means for selecting for use the frame reconstruction data and 
motion data, which provide a pre-defined coding efficiency with a pre-defined 
picture quality. 

20 25. A video coder as claimed in claim 23 or 24, wherein the 

reference frames are stored as blocks. 

26. A video coder as claimed in claim 23 or 24, wherein the 
reference frames are stored as frames. 

27. A video coder as claimed in claim 23 or 24, wherein the block 
25 being coded is coded by means of one neighbouring block only, if only one 

neighbouring block was inter-coded and this neighbouring block had the same 
reference frame as the block being coded. 

28. A video coder as claimed in claim 23 or 24, wherein the frame 
reconstruction data comprises at least one of the following: information 

30 indicating the luminance, colour, location and prediction error frame of the 
frame block. 

29. A video coder as claimed in claim 23 or 24, wherein the motion 
of the frame block is indicated by means of motion vectors. 

30. A video coder as claimed in claim 23 or 24, wherein the earlier 
35 coded neighbouring blocks are immediate neighbouring blocks of the frame 

block being coded. 
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31. A video coder as claimed in claim 23 or 24, wherein the earlier 
coded neighbouring blocks are not in the immediate proximity of the frame 
block being coded, but farther away in the frame formed by frame blocks. 

32. A video coder as claimed in claim 23 or 24, wherein a 
5 prediction error frame is obtained by comparing the frame block being coded 

and the reference frame. 

33. A video coder as claimed in claim 23 or 24, wherein coding 
efficiency is estimated by means of a Lagrangian function. 

34. A computer program which can be read by a computer and 
10 which executes a method for coding video frames in a telecommunication 

system, comprising 

forming a video frame of consecutive stationary frames, 

storing the frame reconstruction data of at least one frame as a 

reference frame and the motion data of earlier coded neighbouring blocks, 
1 5 defining by means of the motion data of one or more earlier coded ' 

neighbouring blocks the motion data of the block to be coded, which 

neighbouring block is formed by means of the stored reference frame, 
defining the reconstruction data of the frame to be coded, 
selecting for use the frame reconstruction data and motion data 
20 representing the block to be coded, which provide a pre-defined coding 

efficiency with a predefined picture quality. 

35. A computer program which can be read by a computer and 
which executes a method for coding video frames in a telecommunication 
system, comprising 

25 forming a video frame of consecutive stationary frames, 

storing the frame reconstruction data of at least one frame as a 
reference frame and the motion data of earlier coded neighbouring blocks, 

defining the motion data of a frame block to be coded using only the 
neighbouring blocks which have the same reference frame used to code the 
30 frame reconstruction data of the block to be coded, 

selecting for use the frame reconstruction data and motion data, 
which provide a pre-defined coding efficiency with a pre-defined picture 
quality. 

36. A computer program as claimed in claim 34 or 35, wherein the 
35 reference frames are stored as blocks. 

37. A computer program as claimed in claim 34 or 35, wherein the 
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reference frames are stored as frames. 

38. A computer program as claimed in claim 34 or 35, wherein the 
block being coded is coded by means of one neighbouring block only, if only 
one neighbouring block was inter-coded and this neighbouring block had the 

5 same reference fnme as the block being coded. 

39. A computer program as claimed in claim 34 or 35, wherein the 
frame reconstruction data comprises at least one of the following: information 
indicating the luminance, colour, location and prediction error frame of the 
frame block. 

10 40. A computer program as claimed in claim 34 or 35, wherein the 

motion of the frame block is indicated by means of motion vectors. 

41. A computer program as claimed in claim 34 or 35, wherein the 
earlier coded neighbouring blocks are immediate neighbouring blocks of the 
frame block being coded. 

15 42. A computer program as claimed in claim 34 or 35, wherein the 

earlier coded neighbouring blocks are not in the immediate proximity of the 
frame block being coded, but farther away in the frame formed by frame 
blocks. 

43. A computer program as claimed in claim 34 or 35, wherein a 
20 prediction error frame is obtained by comparing the frame block being coded 

and the reference frame. 

44. A computer program as claimed in claim 34 or 35, wherein 
coding efficiency is estimated by means of a Lagrangian function. 
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